Extending Tree Automata to Model XML Validation Under Element and Attribute Constraints
نویسندگان
چکیده
Algorithms for validation play a crucial role in the use of XML. Although much effort has been made for formalizing the treatment of elements, attributes have been neglected. This paper presents a validation model for XML documents that takes into account element and attribute constraints imposed by a given DTD. Our main contribution is the introduction of a new formalism to deal with both kinds of constraints. To this end we propose an extension of regular tree automata that allows the construction of a deterministic automaton having the same expression power as that of a DTD. Our formalism gives rise to an efficient validation method.
منابع مشابه
Validation algorithm for attribute-element constraints of RELAX NG
Patterns of RELAX NG can represent interdependencies between attributes and elements. Such patterns are useful for schema authors. However, they make validator implementation algorithmically challenging, since naive approaches easily blow up even for typical inputs. James Clark has provided a derivative-based validation algorithm for handling such interdependencies. This paper shows another alg...
متن کاملTree Automata to Verify XML Key Constraints
We address the problem of checking key constraints in XML. Key constraints have been recently considered in the literature and some of their aspects are adopted in XMLSchema. However, only few works have appeared concerning the verification of such constraints. Unranked deterministic bottom-up tree automata can be used to validate XML documents against a schema. These automata work over (unrank...
متن کاملMultidimensional fuzzy finite tree automata
This paper introduces the notion of multidimensional fuzzy finite tree automata (MFFTA) and investigates its closure properties from the area of automata and language theory. MFFTA are a superclass of fuzzy tree automata whose behavior is generalized to adapt to multidimensional fuzzy sets. An MFFTA recognizes a multidimensional fuzzy tree language which is a regular tree language so that for e...
متن کاملLanguages Modulo Normalization
We propose a new class of tree automata, called tree automata with normalization (TAN). This framework extends equational tree automata, and improved the results of them: recognized tree languages modulo the idempotency f(x, x) = x are closed under complement, besides we do not lose an important decidability result. In the paper, first we investigate the closure properties of this class for Boo...
متن کاملFinite automata on unranked trees: extensions by arithmetical and equality constraints
The notion of unranked trees has attracted much interest in current research, especially due to their application as formal models of XML documents. In particular, several automata and logic formalisms on unranked trees have been considered (again) in the literature, and many results that had previously been shown for the ranked-tree setting have turned out to hold for the unranked-tree setting...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003